Audio-Based Event Detection in Videos - a Comprehensive Survey
نویسنده
چکیده
Applications such as video classification, video summarization, video retrieval, highlight extraction, and so forth., need discriminating activities occurring in the video. Such activity or event detection in videos has significant consequences in security, surveillance, entertainment and personal archiving. Typical systems focus on the usage of visual cues. Audio cues, however, contains rich information that might be effectively used for event detection and Multimedia Event Detection (MED) could benefit from the attention of researchers in audio analysis. Many audio-based event detection methods have been proposed for specific applications, while others are generic. This survey presents an exhaustive review of efforts in the past years to address the issues of using audio-based cues in video event detection. Existing methods that are based on audio features and modeling techniques that have been used are summarized in this survey. We hope to provide a good understanding of different directions in this field of research. KeywordsEvent detection, Highlight Extraction, Gaussian Mixture Model, Support Vector Machine
منابع مشابه
What You Hear Is What You Get: Audio-Based Video Content Analysis
Audio-based video event detection on user-generated content (UGC) aims to find videos that show an observable event, such as a wedding ceremony or a birthday party. In a lower tier, audio concept detection aims to find a sound or concept, such as music, clapping or a cat’s meow. Different events are described by different sounds. The difficulty of video content analysis on UGC lies in the lack ...
متن کاملA Comparison of Rule based and Distance Based Semantic Video Mining
In this paper, a subspace-based multimedia data mining framework is proposed for video semantic analysis, specifically video event/concept detection, by addressing two basic issues, i.e., semantic gap and rare event/concept detection. The proposed framework achieves full automation via multimodal content analysis and intelligent integration of distance-based and rule-based data mining technique...
متن کاملDetection of goal events in soccer videos
In this paper, we present an automatic extraction of goal events in soccer videos by using audio track features alone without relying on expensive-to-compute video track features. The extracted goal events can be used for high-level indexing and selective browsing of soccer videos. The detection of soccer video highlights using audio contents comprises three steps: 1) extraction of audio featur...
متن کاملAudio self organized units for high-level event detection
High-level multimedia event detection aims to identify videos containing a target event. Recent approaches leveraging audio information for this task fall into two broad categories. The first corresponds to holistic bag-of-words approaches based on frame-level descriptors. These are effective for classification, but hard for humans to interpret. The second corresponds to approaches that build a...
متن کاملRobust audio-codebooks for large-scale event detection in consumer videos
In this paper we present our audio based system for detecting “events” within consumer videos (e.g. You Tube) and report our experiments on the TRECVID Multimedia Event Detection (MED) task and development data. Codebook or bag-of-words models have been widely used in text, visual and audio domains and form the state-of-the-art in MED tasks. The overall effectiveness of these models on such dat...
متن کامل